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Abstract: 

The goal of this research has been to understand the computational structure of 
principle-and-parameter linguistic theories: what computational problems do these 
theories pose and what is the underlying structure of those computations? To do 
this, I have analyzed the computational problem of human language comprehension: 
what linguistic representation is assigned to a given sound? This language com¬ 
prehension problem may be factored into smaller, interrelated (but independently 
statable) problems defined on partial phonological, morphological, and syntactic 
representations. For example, in order to understand a given sound, the listener 
must assign a phonetic form to the sound; determine the morphemes that compose 
the words in the sound; and calculate the linguistic antecedent of every pronoun in 
the utterance. I prove that these and some other subproblems are all NP-hard, and 
that language comprehension is itself PSPACE-hard, according to current linguistic 
theory. 
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1 Introduction to the Reductions 


If linguistic theory is a theory of the human language ability, as rational sci¬ 
entists must claim, then linguistic theory must have empirical consequences. 
Some of those consequences will be computational. At the very least, lan¬ 
guage users must use the linguistic representations extensionally character¬ 
ized by the generative grammars of modern linguistics. This in and of itself 
states a class of computational problems: what linguistic representation is 
assigned to a given sound, what articulations express a given underlying lin¬ 
guistic representation, and what is the internal state of a language learner af¬ 
ter a given sound? Each of these problems—comprehension, production, and 
acquisition—may be profitably factored into smaller, interrelated (but inde¬ 
pendently statable) problems defined on partial linguistic representations. 
The language comprehension problem includes as sub-problems the phono¬ 
logical problems of determining stress, tone, and syllable, morphological, 
and articulatory structure; the syntactic problems of detecting empty cate¬ 
gories, disambiguating words, determining phrase and argument structures, 
and computing the linguistic antecedents of anaphora. More abstractly, 
these sub-problems of comprehension consist of completing an independently 
defined portion of an incomplete linguistic representation. 

The goal of my doctoral research has been to determine the computational 
structure of current linguistic theory: what problems does the theory pose 
and what is their underlying structure? To do this, I have studied abstract 
problems in language comprehension with computational complexity theory, 
a theory of the absolute difficulty of solving computational problems, in 
terms of their natural parameters. The major technical result of the thesis 
is that these computational problems are all intractable and cannot be solved 
in practice. 

The greatest difficulty of the research lies in creating precise linguistic mod¬ 
els from the inconsistent, incomplete, and imprecise linguistics theories that 
are available today. These models must be sufficiently abstract to capture 
the core ideas of modem linguistics in order to remain relevant despite rapid 
change in the field. My approach has been to reduce the instances of some 
known problem P into a class of linguistic phenomena such that under¬ 
standing that phenomena corresponds to solving P. As far as I know, the 
reductions hold for all current linguistic theories that explain these phenom¬ 
ena. This places these proofs among the strongest formal results achievable 
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2 The Complex Structure of Sounds and Words 


Words are built from units of meaning called morphemes. In the more fa¬ 
miliar languages of the world, such as Romance languages, morphemes are 
concatenated to form words: this process is ailed affixation. In some lan¬ 
guages, such as Semitic languages where vowels are morphemes, a morpheme 
may appear more that once inside another morpheme (this is called infixa¬ 
tion). For example, the Arabic word katab , meaning ‘he wrote’, is formed 
from the active perfective morpheme a infixed to the ktb morpheme. 

In addition to morphemes, words consist of syllables. Each syllable contains 
one or more vowels V (its nucleus) that may be preceded or followed by 
consonants C. For example, katab consists of two syllables, the two-segment 
syllable CV and the three-segment closed syllable CVC. 

Syllable structure can interact with morphological structure. For example, 
all Arabic words must end in the closed syllable CVC, and therefore the a 
morpheme must be infixed to the ktb morpheme in order to form a word with 
permissible syllable structure. It cannot be prefixed, because Arabic does 
not permit syllables of the form VCCC; nor can it be suffixed, because CCCV 
syllables are not permitted. These interactions appear simple, but they 
can become extremely complex when morphemes are ambiguous or when 
segments are underspecified as in distorted speech. In fact, determining 
the morphological and syllable structure of an underspecified sound is like 
solving the satisfiability problem for Boolean formulas, where one-segment 
morphemes are Boolean variables, vowels true literals, and consonants false 
literals. Then three-segment syllables would correspond to satisfied 3-CNF 
clauses, and words to entire 3SAT instances. 

Among the world’s languages, assimilation is one of the most common 
phonological processes. Assimilation is the process whereby some segment 
comes to share properties of another segment in the course of the derivation 
of surface forms from underlying forms. For example, in English, conso¬ 
nant nasality assimilates to immediately preceding vowels; assimilation also 
occurs across morpheme boundaries, as the varied surface forms of the pre¬ 
fix /in-/ demonstrate: /in-/ +/logical/ —► [illogical] and. /in-/+/probable/ 
—> [improbable ]. In other languages, assimilation is unbounded and can af¬ 
fect nonadjacent segments: these assimilation processes are called harmony 
systems. In the Turkic languages all suffix vowels assimilate the backnesss 
feature of the last stem vowel; in Capanahua, vowels and glides that pre- 



cede a word-final deleted nasal (an underlying nasal segment absent from 
the surface form) are all nasalized. 

Harmony systems are powerful computational mechanisms. In fact, we can 
virtually duplicate the preceding reduction idea merely by replacing mor¬ 
pheme infixation with harmonic features. In the second reduction then, 
harmonic features will represent Boolean variables, and harmony processes 
will ensure consistent truth assignments to the Boolean variables; the other 
details are essentially unchanged. 


2.1 Word Recognition in the Nonlinear Model 

In phonological theory, segregated representations obey simple principles; 
this section examines the complexity of these interacting representations 
(planes and tiers) and morpheme combination (affixation and infixation). 
In order to do this, we pose the word recognition problem: 

Word Recognition Problem (WORD) 

Given an underspecified timing slot vector V, and a 
morphological model M, is V a permissible word according to 
M? 


The timing slot vector is an abstract representation of a speech sound in 
terms of the articulations (phonemes) necessary to produce that sound. It 
is underspecified when articulations necessary to produce a word of the 
language are missing from the representation. 

Theorem 1 WORD is NP-hard in the rule-free nonlinear morphological 
model. 

Proof. The proof is by a reduction from 3SAT to WORD. The reduction 
input is a 3-CNF formula F containing clauses G\, C 2 , ..., C p in the Boolean 
variables x lt X 2 ,... Each clause contains exactly three distinct literals 
labeled by C, = (a* V 6; V c,). The output is a rule-free nonlinear model M 
(list of morphemes, feature geometry, and syllable templates) and an under- 
specified timing slot vector V, such that V is a permissible word according 
to M if and only if F is satisfiable. 
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Each variable 2 , is represented by a one-segment morpheme ,, and each 
clause Cj is represented by a three-segment syllable crj. The reduction con¬ 
structs the morphemes P 1 P 2 ■ ■ . Pm plus two features: [±TRUE] to represent 
the truth value assigned to a variable, and [±NEG] to represent whether a 
literal is negated or not. Segments specified with the features [+NEG.-TRUE] 
or [-NEG.+TRUE] represent true literals, which correspond to vowels. For 
example, the false literal 2 ^ is represented by the morpheme ^4 with the 
feature vector [+NEG,+TRUE]. 

Syllable templates ensure that each 3-CNF clause contains one or more 
true literals by requiring syllables to span three segments and contain one 
or more true literals (FFT, FTF, FTT, TFF, TFT, TTF, TTT). In other 
words, we are simply excluding FFF, one of eight possible three-segment 
Truth templates,’ which corresponds to positing a language where syllables 
span exactly three segments and must include a vowel. 

The timing slot vector V contains the string of formula literals, where each 
formula literal is represented by a morpheme and the appropriate setting of 
the NEG feature. 

1 + n distinct planes are needed: a single one-tiered plane for each morpheme 
qua variable, and one two-tiered plane for syllables qua satisfied 3-CNF 
clauses. The reduction requires 0(n) time and space to specify the nonlinear 
model and does not use any morphological rules, which strongly suggests 
that the nonlinear model is more powerful than A/’PQ 

2.2 Sound Recognition in the Nonlinear Model 

Sound Recognition Problem (SOUND) 

Given an underspecified timing slot vector V, and a 
phonological model M, is F a permissible phonological 
representation according to Ml 

Theorem 2 SOUND is NP-hard in the rule-free nonlinear phonological 
model. 

Proof. The proof is by reduction from 3SAT to SOUND, and follows 
the preceding proof of theorem 1 in its idea. Phonological harmony will 
replace morpheme infixation as the linguistic process that ensures consis¬ 
tent Boolean truth assignments. As before, the reduction input is a 3-CNF 
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t 


— [3-clause] ■ 


[T] -[ p i-[FI 



[T]-[T]-[F] 


[a,-neg] — [b,+neg] 


c ,-negj — [a,-neg] — [b,-negj—[c,-neg 



Figure 1: Nonlinear morpho-phonological representation of the 3SAT formula / = 
(a V 6 V c) A (a V b V c), satisfied by a, b true and c false. The timing slot vector, 
drawn horizontally in the center of the figure with a heavy line, represents the 
string of formula literals. Each of the three one-segment morphemes, drawn below 
the timing slot vector with noncrossing association lines, resides on its own plane 
and represents a formula variable. The final two tiers, drawn above the timing slot 
vector, represent satisfied 3-CNF clauses: one T,F-tier for literals and a second tier 
for satisfied 3-CNF clauses. 
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formula F and the output is a (partial) rule-free nonlinear phonological 
model M (rooted feature geometry, syllable templates, and list of harmony 
processes) and an underspecified timing slot vector V, such that V is a 
permissible phonological representation according to M if and only if F is 
satisfiable. 

Each variable X{ is represented by a binary place-of-articulation feature p ; ; 
the truth assignment to r; is encoded in the binary harmonic articulatory 
feature /;; and each clause Cj is represented by a syllable aj spanning three 
segments. The reduction also constructs the binary feature [±NEG] to rep¬ 
resent whether a literal is negated or not. Each harmonic feature is imme¬ 
diately dominated by a place-of-articulation feature p„ which is connected 
to the root: 



Ml Ml Mi [f 2 i].M) [f„i] 

Segments specified with the features [pfi ,/;0,+HEG] or [p;l,/;l,-NEG] rep¬ 
resent true literals of the variable x t , and will correspond to vowels. For ex¬ 
ample, the false literal Z 4 is represented by the feature matrix [p 4 1, / 4 1, +NEG] 

Being harmonic, each feature will receive the same value throughout the 
timing slot vector. As before, syllable templates ensure that each 3-CNF 
clause contains one or more true literals by requiring syllables to span three 
segments and contain a true literal. The timing slot vector V contains 
the string of formula literals, where each formula literal is represented by its 
place of articulation feature p, and the appropriate setting of the NEG feature. 
For example, the negated literal rj is represented by the underspecified 
feature matrix [p 4 l, +NEG] . | | 


Comments. The reductions to the nonlinear model are blocked: 
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• If planar interaction is limited to superposition of tier features. In this 
highly restricted nonlinear model, each articulatory feature can be 
associated with at most one plane: planes are truly computationally 
independent because they interact only in that they each may affect 
the acoustic signal. But this nonlinear superposition model is far too 
impoverished for natural language, where all phonological processes 
have access to one small fixed set of articulators and hence must make 
maximal use of their representational possibilities (McCarthy, 1981). 
For example, consonants and vowels are segregated into distinct em¬ 
planes and are representationally distinguished by the non-phonetic 
[consonant] feature. But consonants and vowels must both use the 
same set of phonetic features in articulation. They are distinguished 
in articulation only by the obstruction or nonobstruction of the air 
flowing from the lungs to the lips. (Consonants require an articulator 
to contact the stationary part of the vocal tract.) But “this striking 
difference between the production of vowels and consonants must not 
be allowed to obscure the obvious fact that for the production of both 
types of sounds speakers have at their disposal only a single piece of 
anatomical machinery, the vocal tract with its six articulators. . .” 
(Halle, 1987). 

(Nonetheless, the idea of the second reduction is problematic, because 
every segment’s [consonant] feature is directly affected by some har¬ 
monic feature in that segment. I do not know of any harmony processes 
with this character.) 

• In the reductions, the timing slot vector, which is an abstract rep¬ 
resentation of the acoustic signal, contains sufficient information to 
identify each literal/morpheme/place-of-articulation, while systemat¬ 
ically omitting the feature that encodes truth assignment. At first 
glance this is plausible because, as is well known, human speech is 
fantastically resistant to distortion, no matter how underspecified the 
acoustic signed may be. 

One way to block the reductions would be to require the nonlinear 
representation to be stable with small changes in perceived phonetic 
features. As things stand, this is not a property of the preceding 
reductions, which would fail because we could not ensure segregation 
of formula variables or consistency of truth assignments in the presence 
of noise. We might meet this stability requirement by using more 
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phonetic features, to redundantly identify variables and their truth 
values. 

• We might disallow repeated morpheme infixation or universally bound 
the number of harmony processes by a small constant. Both infixa¬ 
tion and harmony are bounded in known languages: no language is 
thought to have more than two harmony processes, and morpheme 
infixation appears to be limited to infixing one vowel per morpheme. 
Unfortunately, there is no elegant way to explain boundedness beyond 
mere stipulation in an approach to linguistic theory that distinguishes 
knowledge and ability, as generative linguistics does. 

2.3 Postmortem 

The aim of the preceding analysis was to determine the complexity of planar 
interactions, harmony processes, and morpheme combination in the non¬ 
linear model. The reduction very clearly shows that the simplest planar 
interactions are complex, regardless of (the potential complexity of) both 
interplanar computations, such as stress shift, and determining the corre¬ 
spondence between surface and underlying forms. With respect to mor¬ 
pheme infixation and phonological harmony, we can see that both processes 
are costly when coupled with articulatory underspecification. 
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3 Structure of transformational models 

The number of levels of linguistic representation, their qualities, and the 
rules relating them are central concerns in generative linguistics. Within the 
principles and parameters approach, syntactic theories have been broadly 
classified both by the levels of representation they employ (S-structure and 
PF, and sometimes D-structure, NP-structure, and/or LF) and by the rule 
determining the relation between a trace and its antecedent (move-alpha 
versus ‘rules of construal,’ such as Rizzi’s chain formation algorithm, Kayne’s 
g-projection, or Roster’s dynasty formation). 

Transformational theories include D-structure and S-structure representa¬ 
tions, and hold that S-structure is derived from D-structure by successive 
move-alpha transformations, where each application of move-alpha uniquely 
relates a trace to its antecedent, and where all relations between traces and 
their antecedents determined by such a derivation are represented at S- 
structure. 1 


3.1 Empirical properties of move-a 

It has been difficult to find empirical arguments favoring the transforma¬ 
tional approach over competing ‘representational’ approaches. Arguments 
based on exhibiting natural properties of D-structure that are difficult to 
state at S-structure have not been entirely persuasive because S-structure 
(redundantly) represents all information represented at D-structure, due to 
trace theory and the projection principle. Thus, natural properties of D- 
structure will also be naturally statable at S-structure in terms of traces. 

Arguments for transformational theories based directly on empirical proper¬ 
ties of move-alpha derivations have been the most persuasive, simply because 
theories lacking move-alpha derivations will lack those empirical properties. 
Such arguments exhibit properties of move-alpha or entire derivations that 
are difficult to state as S-structure rules of construal between traces and 

1 Modern transformational theories are often misleadingly named ‘derivational’ in con¬ 
trast to the competing ‘representational’ approaches. These widely-used appelations are 
misleading because all generative linguistic theories admit explicitly specified represen¬ 
tations, that is, representations derived according to some effective procedure. The only 
distinquishing issues are the role of move-alpha in the derivation of syntactic representa¬ 
tions, and the status of D-structure, NP-structure, and LF in the theory. 
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their potential binders. For example, in his Fall 1987 class lectures Chom¬ 
sky argued, with empirical evidence, that derivations must obey a ‘least 
effort’ principle: that is, the number of derivation steps required to relate 
a trace to its S-structure binder must be minimal. Such global properties 
of derivations are thought, without proof or supporting argument, to be 
difficult to express as S-structure rules of construal. 

Another powerful argument for move-alpha is that the conditions on move- 
alpha can be satisfied neither at D-structure nor S-structure, but only at 
an intermediate stage of the derivation. This obtains whenever an appli¬ 
cation of move-alpha disturbs the obligatory structural relation (typically, 
c-command) between a previously moved constituent and its trace, as when 
a constituent /3 containing a trace t± is moved outside the c-command domain 
of its antecedent a;, as in (1). 


b. [|j3...tj...]j...[ ]] 

Then the relation between the trace t; and its binder a, will not obey the 
same structural conditions at S-structure as other traces and their binders 
do. In (lb), t{ is neither c-commanded nor bound by its antecedent at S- 
structure, yet the resulting S-structure is surely permissible, as in passive 
VP topicalization structures: 


( 2 ) 


a. 


John, has never been [yp arrested t, by the FBI] 


[vp arrested ti by the FBI j John, has never been tj\ 

(The complexity proofs below will rely on this type of “rolling movement” 
construction.) The choice between transformational and alternate theories 
ultimately reduces to the necessity of representations intermediate between 
D-structure and S-structure: if intermediate representations are necessary, 
as has been argued in the literature, then syntactic representations must 
include ordered transformational derivations. 


3.2 Conceptual objection to transformations 

Transformational grammars (TGs) seek to explicitly relate linguistic struc¬ 
tures via a sequence of transformations. For example, in transformational 



theories, a transformation relates an active sentence to its passive variant. 
But there is a major conceptual problem with this approach to linguistic 
theory: TGs do not construct explicit representations. In a TG, linguistic 
relations are implicitly encoded both in the underlying and surface forms 
and in the series of transformations required to derive the surface form from 
its underlying form. In effect, transformations serve double duty, both as a 
linguistic representation and as the derivational rules of a model of compu¬ 
tation. 

For example, in transformational grammars elements are nonlocally related 
in a derivation of syntactic forms using the unbounded one-to-one move-a 
transformation, locally bounded by linguistic principles such as the empty 
category principle (ECP) and chain condition. The representations assigned 
by these theories (chains) include intermediate constituents (traces) whose 
only apparent role is to allow the iterated local representation of nonlocal 
relations. Intermediate traces represent only the history of the computation 



of nonlocal relations. 2 


As a consequence of this failure to separate representation and the process 
whereby it is constructed, important linguistic relations, such as the nonlocal 
antecedent-trace relation, are obscured by the transformational system. This 
is also the reason why the transformational component of current linguistic 
theories (move-alpha) bears so no resemblance to the rest of the theory (that 
is, to phrase and argument structure; to case, control, binding, and theta 
relations; or to the lexicon). 

Why should a linguistic theory seek to hide the structure of linguistic rep¬ 
resentations in order to explicitly encode the derivational relations between 
those representations? This corresponds to statement of the 3SAT prob¬ 
lem in terms of truth assignments and rides that derived all possible 3SAT 
instances satisfied by a common truth assignment from that given truth 
assignment. What would be the point? The best statement of a problem 

2 These intermediate traces are the result of a particular conception of the ECP as a local 
bound on move-a. I would be more persuaded of their existence if they interacted with 
other grammatical components. For example, they might have binding or phonological 
effects. Adjunct traces may satisfy the ECP only via antecedent government at LF; as 
a consequence, adjunct extraction results in intermediate traces that may not be deleted 
at S-structure. Thus, the only intermediate traces required at S-structure are the traces 
of adjunct extraction, but these non-case-marked traces do not block want-i-to —► wanna 
contraction, which is only blocked by case-marked elements (Chomsky 1986a:162). For 
example: 

(3) how do you wanna solve the problem? 

As expected, the intermediate traces in specifier of CP and adjoined to VP do not block 
phonological contraction. Neither do these intermediate A-traces affect binding relations, 
whose domain is NPs in A-positions: 

(4) [which woman]; did John [ V p‘ dream 

[cp < 2 [ip Bill [ 1° [vp t J [ V p saw j j [pp with t;]]]]]]] 

The governing category of the direct object is IP (the complete functional complex), and 
therefore the c-commanding trace t] adjoined to VP could bind the anaphor in object 
position within its governing category, if the trace were in an A-position. But, as ex¬ 
pected, herself is in fact ^unbound, which strongly suggests that tj is only relevant to the 
computation of nonlocal A-movement as constrained by the ECP. The precise formulation 
of the ECP, and the existence of the intermediate traces it requires, is the topic of much 
active research and debate. But the fact that these intermediate traces do not enter into 
other syntactic relations casts doubt on their explicit syntactic representation, at least in 
my mind. 



the stratum they *u« (by aeeidcat, m it **te). 
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4 The Complex Structure of Sentences 


The syntactic structure underlying even simple constructions can be quite 
intricate. For example, according to current syntactic theory, the simple 
passive sentence John was seen is assigned a complex syntactic structure, 
whose phrase structure component follows: 

(5) [ip [np John] i [was [ V p see+en t;]]] 

Most interestingly, each word interacts strongly with every other word in 
the sentence. The passive verb see+en selects the underlying object tj and 
assigns it a ‘patient’ thematic role; the underlying object appears as the 
surface subject John ; the subject agrees with the auxiliary verb was, which 
assigns it nominative case; and, to complete the circle of interactions, the 
auxiliary verb selects the passive verb. These properties of words, such 
as case-marking, thematic role assignment, selection and agreement, are all 
independent, not directly deducible from the phonological form of the words, 
and potentially correlated in the lexicon. 

It is easy to see that interactions among the words in a sentence can po¬ 
tentially become extremely complex. For example, imagine if the lexicon 
contained two slightly different verbs, seei and see 2 , with the same phono¬ 
logical form (homophones) but different selectional restrictions. Then verb 
phrases could encode satisfied 2-CNF clauses: see! would be false and se¬ 
lect a true argument; conversely, see 2 would be true and select either a 
true or false argument. The consequence is that any verb phrase headed 
by see would contain a word representing a true literal. We could even get 
two literals of the same variable to agree on truth value by moving one to 
the subject position of the other, where they must agree, exactly as in the 
passive construction: the underlying object moves to the subject position, 
where it must agree with the auxiliary verb. Then if words were Boolean 
literals, it might even be possible to encode 3SAT instances in sentences. 


5 Complexity of Move-a 


Transformational derivations have played a central historical role in gener¬ 
ative linguistics. In each case, transformational models have been proven 
intractable (Peters and Ritchie, 1973; Rounds, 1975). To investigate whether 



17 


/ 


current transformational models, based on the generalized move-a transfor¬ 
mation, share the computational characteristics of their ancestors, we define 
a natural problem posed by our language ability and determine its com¬ 
plexity according to the linguistic model. Consider the lexical resolution 
problem, a subproblem of language perception: 

Lexical Resolution Problem (LRP) 

Given an S-structure S with ambiguous or underspecified 
words, and lexicon L containing ambiguous words, can the 
words in 5 be found in the lexicon LI 


Theorem 3 The LRP is NP-hard in transformational models with move-a 


Proof. The proof is by a reduction from 3SAT to LRP; it begins with 
a sketch of the proof idea along with a simple example. I then state the 
essential characteristics of the transformational models necessary for the 
reduction to succeed. 

The input to the reduction is a 3-CNF formula F containing clauses C'\, (? 2 , 
■ • -, Cp i n the Boolean variables x %,..., x n . Each clause contains exactly 
three distinct literals labeled by Ci = (<Zj V 6 , V c;). The output is a lexicon 
L and an S-structure 5 containing underspecified words such that the words 
in S can be resolved found in L if and only if F is satisfiable. 

The first step is to create a D-structure that represents F , where the ith con¬ 
stituent represents the ith formula literal and selects the i + 1 th constituent 
(see figure 2). The selectional properties of constituents will ensure that 
each 3-clause contains at least one true literal, although lexical ambiguity 
will prevent us from knowing which literals in the 3-clause are true. To do 
this, the first literal a, of a 3-clause C x must promise to make the 3-clause 
true, either by being true itself or by selecting a literal that promises to make 
the 3-clause true; to fulfill its promise, the second literal b{ must either be 
true or select a true literal: 



true 



true 

false 


c, true 


(If ai is true, it selects the next literal 6 , with any truth value.) 
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Given variable truth values, affixes listed in the lexicon determine literal 
truth values; they will negate or preserve variable truth values, according to 
F (see below). 

Then, scanning from right to left, each literal moves to the specifier position 
of the closest literal of the same variable, either by long movement or by 
successive cyclic movement (see figures 3 , 4 ). 

In the resulting S-structure, the specifier position of the ith occurence of a 
variable contains the i + 1 th occurence of the same variable, and agrees with 
it by specifier-head agreement: all but the first occurence of a variable are 
contained in the specifier position of the first occurence of that variable (see 
figure 4). 

Now, by specifier-head agreement at S-structure, all variables have consis¬ 
tent truth assignments, and by D-structure selection all clauses contain a 
true literal, where negation is performed by affixes. The formula is satisfiable 
if and only if the corresponding D- and S-structure are well-formed) | 


Requirements on the transformational model. Each formula literal 
is represented by a constituent with the following characteristics: 

1 . Transparent to extraction. The construction must permit successive 
cyclic movement (typically adjunction) between bounding nodes in 
order to satisfy the sub jaceny condition of bounding theory. 

2. Contains a landing site that agrees with the constituent head. The con¬ 
stituent will contain a specifier position; the head of the constituent 
will agree with the specifier position and assign case to it; the land¬ 
ing site will be limited to literals of the same variable by identifying 
agreement features on the head. 

3. Undergoes obligatory movement. The constituent will be a properly 
governed argument assigned a theta-role but no case; correspondingly, 
it must contain a properly governed caseless position that it selects 
and assigns a theta-role. 

4. Selectional properties are correlated with agreement features. Each 
constituent will contain an element that selects the constituent rep¬ 
resenting the next literal. Local affixation rules will morphologically 
merge the head of the construction with this element, thereby corre¬ 
lating selectional properties with agreement features in the lexicon. 
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Figure 3: Then the sixth constituent (c 2 ) moves to the specifier position of the 
third constituent (c 2 ), leaving behind a trace t,. This transformation relates the 
c 2 literal of the second clause in the 3-CNF formula to the c 2 literal of the first 
clause. (This example assumes constituents transparent to movement.) Now both 
constituents agree, by specifier-head agreement; therefore, the corresponding literals 
of the formula variable c will be assigned the same truth value, even though they 
appear in different clauses. 



Figure 4: The end result is an S-structure where specifier-head agreement ensures 
that all instances of a variable receive the same truth value. 









6 Complexity of Move-a with Trace-Deletion 


A central question for current transformational theories of syntax, such 
as the trace-deletion approaches of Lasnik and Saito (1984) and Chomsky 
(1986), is what are the consequences of interacting agreement relations, such 
as specifier-head agreement, head-head agreement, head-projection agree¬ 
ment, and the various forms of chain agreement (link, extension, composi¬ 
tion)? This question is especially important because although these forms of 
agreement are explicated in distinct definitions, they are all performed with 
the same mechanism of coindexing. Thus, these models claim that there is 
only one form of agreement, disjunctively determined by the interaction of 
complex rules and principles. 

In the Barriers model of Chomsky (1986), blocking categories (BCs) stop 
unbounded application of move-a. Informally, a BC is a category not 0- 
marked by a lexical A 0 . For example, matrix verb phrases are BCs because 
they are selected by the nonlexical category 1° (inflection) without being 
assigned a 0-role. Unbounded A-movement becomes possible when a cate¬ 
gory is moved local steps, adjoining to intermediate nonargument positions 
before moving on (adjunction is typically to BCs). As in GPSG, this notion 
of strict phrase structure locality does not suffice to prevent computational 
intractability unless local agreement (specifier-head, head-head) can be de¬ 
coupled from nonlocal agreement (chains), which may be impossible. 

Theorem 4 The LRP is NP-hard in the Barriers model. 

Proof. The idea of the proof is from theorem 3 above; although the idea 
is simple, the actual details are quite complex. On input 3-CNF formula F, 
the reduction will create a lexicon L and a D-structure that represents F , 
then apply move-a, and finally insert underspecified lexical forms to derive 
an incomplete S-structure that can be completed according to the lexicon 
L if and only if F is satisfiable. The crux of the reduction is to represent 
each literal with the noun complement structure [ N [ I VP }] in (I) (for 
example, desire [to visit placesJ): 

[ffPi ■ ■ ■ bfPi x liPi M i [v Pl v NPi+i ]] ]] (i) 

The Barriers model endows this construction with the required characteris¬ 
tics: 
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1. Transparent to extraction. NP, + 1 can be moved out of NP; in the 
structure (1). VP is a BC and barrier for NP l+i because it is not 
L-marked, but NP ; +1 can adjoin to the nonargument VP and void its 
barrierhood because nonarguments may be freely adjoined to. Both 
NPi and IP; are L-marked, and therefore are neither BCs nor barriers 
for further NP,+i raising. Thus, NP , +1 can be A-moved to any c- 
commanding specifier-of-IP position [e] without violating the ECP be¬ 
cause all traces are properly governed (both ^-governed by the verb V 
that selects NP I+1 , and 7 -marked (antecedent-governed) by the deleted 
trace adjoined to VP). 

Reinhart (personal communication) suggests a similar, albeit marginal, 
natural example where an NP containing an argument trace is topi- 
calized to CP specifier from an L-marked position : 3 

(7) ? [What burning t i+ 1 ], did John say [of what book]; + i [f t would 

be magnificent] 

2 . Contains a landing site that agrees with the constituent head. The in¬ 
ternal IPj contains a specifier position (landing site) that will agree 
with I by specifier-head agreement in nonlexical categories; the spec¬ 
ifier position will also agree with N (the constituent head), by predi¬ 
cation. Alternately, head movement from V to I to N can create an 
inflected noun /]v [1 V I] N] in the X° position of NP t that will agree 
with the landing site. Although I cannot find a natural example of 
such an inflected noun, no arguments or analyses exclude it in princi¬ 
ple. A close natural example is norm incorporation in Mohawk verbs 
(Baker 1985:139). 

3. Undergoes obligatory movement. V assigns a 0-role but no case to 
the NP i+1 position, requiring NP ; +1 to move. This is possible if V 
has lost its ability to assign case (passive morphology) or if NPj+j is 
the underlying subject of VP,, as in many current accounts (Sportiche 
1986, Fukui 1986, Larson 1987, et cetera). 

J Chomsky (pc) suggests that the correct analysis of (7) is 

(6) [what burning], did John say [[ t, of what book] would be magnificent] 

and that a better topicalization example might be What burning did John say (that) of 
that book, Mary thought would be magnificent. 
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4. Selectional properties are correlated with <p^features. V undergoes oblig¬ 
atory head movement to the affix I, creating an inflected verb in the 
head of IP. As noted above, the ^-features will appear on the inflected 
verb by specifier-head agreement, where they may be systematically 
correlated with the verb’s selectional properties in the lexicon. 

Details. The input to the reduction is a 3-CNF formula 

/ = («llUl 2 Ul 3 ), (U21U22U23), • • • , (UnlUn2Un3) 

with n clauses, 3n literals, and variables qi...qm- As before, the reduction 
will create a D-structure that represents F, and then apply move-a and 
lexical insertion to derive the S-structure S. The reduction output will 
consist of an S-structure 5 containing underspecified words and a lexicon L , 
such that the words in 5 can be resolved according to L if and only if F is 
satisfiable. 

The reduction algorithm consists of five steps: 

1. Create the two \p-features VBL and TRUE 

The m- ary VBL feature identifies the formula variables: the formula 
variable qk is associated with the feature [VBL Jfc]. The final binary 
^-feature TRUE denotes the truth assignment to the given variable. 

2 . Create the D-structure representation of f. 

For each literal u,_, in the i th clause, create the D-structure noun com¬ 
plement construction: 

(®) [w,; [ip* foil Ui [vp tj Vij NPij+i ]] ] 

where NP^ represents the subformula Ui 3 ... u n3 ; NP U+1 represents 
the subformula u;_, +1 ... u n3 ; and NT bears the ^-features of the vari¬ 
able qk that corresponds to the literal Uij. 

3. Apply head-movement within noun complements. 

Move V,j to lij, and then move the inflected verb [V,j I tJ ] to N,_,: 


(9) [np %j Nn [i PiJ fan, [ VPtj V tJ NPij+i ]]} 



25 


( 10 ) [Wy // VI] Nij] [ IPij [eijj t [yp tj t NP t j + i ]] ] 

As a result, all A 0 positions other than N° positions have traces in 
them; N° positions are filled with inflected nouns of the form [[V I] 
N]. (The goal of this operation is to force agreement between the verb 
and noun; this could also be done with predication.) 

4 . Apply long distance NP-movement across noun complements. 

Starting with the rightmost, innermost NP (NP n 3) and scanning left 
and up, move NP,-j to the specifier position je; m ] of the closest inflection 
I im that agrees with it in ^-features: 


( 11 ) 


• ' • IvPlm [[ V V N lmJ [lP lm fa m ] t [ V p lm 
I] [ IPi] [eij] t [ VPt] t NP{j + i 1] ]] ]] } 


1 yPlm + l 


■ • [tVPy [[V 


( 12 ) • • • lwp lm [[V I] N lm ] [i Plm [NPij] t [ VPlm t [ NPlm ^ ... Uj] ]]} 

The leftmost literal of each formula variable is assigned case in situ 
by the formula literal that selects it; all other literals of the variable 
receive a theta role but no case, which is what forces them to move. All 
literals of a variable but the rightmost one assign case to their specifier 
position, so that a literal will be able to land there. Our third ally in 
making the movement obligatory is the extended projection principle, 
which requires specifier of IP to be filled. 

Movement is by adjunction to intermediate VPs, deleting intermediate 
traces. This movement satisfies the ECP because all traces are prop¬ 
erly governed and therefore 7-marked at S-structure (both 0 -governed 
by the verb Vij that selects them, and antecedent-governed by the 
deleted trace adjoined to VP). The movement satisfies minimality triv¬ 
ially because the deleted trace is not excluded by VP. Because NPs are 
only moved to specifier positions that c-command their bound traces, 
the resulting structure also satisfies the chain condition and binding 
theory when the chain is created. Note that the movement is per¬ 
missible independent of whether we express the antecedence relation 
induced by move-a using indices or links (cf. Higginbotham, 1983 ). 

Now NP^ agrees with its c-commanding NP/ m on the TRUE feature by 
specifier-head agreement in IP and head-head agreement between I; m 
and the head of NP/ m . 
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5. Perform (underspecified) lexical insertion. 

Insert the morphological form aj0j 19 ^ for the inflected noun [[V I] N,-_y] 
when Uij — qk (that is, is not a negated variable); when Uij = qf, 
insert the morphological form aW'T^fc instead. 

( 13 ) • • • hfP lm HV 11 AW [ lPlm [NPijJ t [ VPlrn t NP lm+1 ]] } 

(14) ... [ NPtm [aififftpk] [ip lm [ctifirfiptj t [ V p lm t NPi m+ i ]] ] 

Note that each ‘oj 0 j' is a listed word; that 7 and 7 are suffixes; and 
that ifk is the morphological realization of the ^-feature [VBL k]. 

Further note that the surface structure is a string of inflected words. 
For example, on input / = <hq 2 <l 3 <l 2 <h<l 3 the reduction algorithm would 
yield the surface string or linguistic expression: 

Ol0\19l 01202191 0-202192 0x01192 0303193 Ct303j93 

We now turn to morphological and lexical details of the reduction, namely 
how we ensure that each clause has one true literal, that literals and variables 
may be true or false, and that negation changes truth values. 

The lexicon contains ambiguous inflected norms [[V I] N] that have un¬ 
dergone verbal incorporation. We define the morphological primitives and 
lexicon to ensure that the N-complement structure NP tl for the first literal 
in the i th clause u tl either (a) corresponds to a true literal, or (b) selects a 
true literal (as represented by) NP^ or (c) selects a false literal NPi 2 that 
will select a true literal NP, 3 . Similarly, the N-complement structure NP ,2 
for the second literal in the i‘ h clause u l2 must either (a) correspond to a true 
literal, (b) correspond to a false literal, or (c) correspond to a false literal 
and select a true literal NP, 3 . Finally, the structure NP l3 for tt, 3 , the third 
and final literal in the i th clause, may correspond to a true or false literal. 

This lexical system requires one ^-feature TRUE on nouns to encode variable 
truth assignments; one non- 9 -feature LITERAL to encode literal truth values 
on both norms and verbs; and one non-v?-feature T on nouns that verbs can 
select for. Recall that we are assuming that the m-ary ^-feature VBL will 
be morphologically realized on the nouns in the lexicon, and will distinguish 
the formula variables: N ij and N/ m bear the same VBL value if and only if 
U{j and ui m are literals of the same variable. 
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The morphological system is constructed assuming the relativized head def¬ 
inition of DiScuillo and Williams (1987). There are six ambiguous morpho¬ 
logical primitives: the prefixes ai,a 2 ,a 3 and roots 0\,02i03- Together they 
define three ambiguous listed words oudi, a 2 /3 2 , and a 3 /3 3 , which may each 
combine with either of the two nominal suffixes 7 , 7 . 


Listed Word 

Prefix (a,) features 

Root (A) features 

CClAl 

N [LITERAL l] 

V [LITERAL 1] 

V[LITERAL 0,Select T] 

N [LITERAL 0] 

01202 

N [LITERAL 1,T] 

V [LITERAL 1] 

V[LITERAL 0] 

N [LITERAL 0] 

V[LITERAL 0,Select T] 

N[LITERAL 0.T] 

Ot303 

N [LITERAL 1,T] 

V [LITERAL 1] 

V [LITERAL 0] 

N [LITERAL 0] 


The suffixes relate variable truth assignments to literal truth values. For 
example, 7 will only be lexically inserted to represent a negated variable, 
and therefore inversely relates the truth value of the variable and literal. 


Suffix 

Feat ures 

Selects 

7 

N[TRUE 1] 

[LITERAL 1] 

N[TRUE 0] 

[LITERAL 0] 

7 

N[TRUE 1] 

[LITERAL 0] 

N[TRUE 0] 

[LITERAL 1] 


Finally, there are m listed inflectional affixes corresponding to the m possible 
values of the VBL ^-feature. This completes the presentation of the reduction 
algorithm.! I 


Comments. The proof exploits a flaw in the A-system of local ^-relations 
described in Barriers that arises because distinct linguistic relations (head 
chains, NP-chains, and specifier-head agreement) are conflated into the re¬ 
lation of A-chain coindexing. Abstracting from the details of the preceding 
reduction, it is very clear that complex, undesirable interactions can arise 
when all agreement is performed by coindexing, as it is in both trace-deletion 
models considered above. Linguistically, this corresponds to conflation of 
‘local’ relations, such as specifier-head agreement, and ‘nonlocal’ relations, 
such as long A-movement. 

The lexicon in the reduction above strongly correlates ^-features with se- 
lectional restrictions. In effect, words can indirectly select the selectional 
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restrictions of their complements. In my view, this is perfectly reasonable: 
it is well-known that the properties of words in natural lexicons are not sta¬ 
tistically independent, and statistical correlation is perfectly natural, even 
expected, in any listing of idiosyncratic properties (that is, a lexicon). For 
example, it is possible to imagine a natural language where the ^-features 
on the subject are correlated with the object’s ^-features. In brief, the syn¬ 
tactic and lexical models permit this kind of correlation, it may be used in 
complexity proofs. 

The most serious question is whether the complexity of the lexical resolution 
problem accurately reflects on the complexity of language perception as a 
whole. The S-structure built by the reduction is very complex and I have not 
established that it is uniquely determined by its phonological form. If not 
uniquely determined, then hearers would be free to perceive some simpler 
S-structure with the same phonological form as the intractable S-structure 
built by the reduction. If this were possible, then the lexical resolution 
problem would no longer be a realistic subproblem of linguistic perception. 

Theorem 5 The LRP is NP-hard in the Lasnik-Saito model. 

Proof. The preceding proof proceeds without alteration in the Lasnik-Saito 
model because in that model, ^-government suffices for proper government, 
and traces may be deleted at any level after 7 - mar king j | 

0.1 Blocking the Barriers reduction is difficult 

How might we change the Barriers model in order to block the preceding 
reduction? The Barriers model appears to crucially rely on the assumed 
atomicity of the two nonlexical categories C and I, which together form a 
barrier to argument movement. If C assigns nominative case to specifier-of- 
IP, as some authors have proposed (Bennis 1980, Dasgupta 1985), then CP 
and IP form an atomic unit, a barrier to antecedent government. Although 
this would stop the exact construction used in the reduction, it would also 
leave the structure of the infinitival noun complement construction unex¬ 
plained. A second, theory-internal, problem with this change is that now 
both C and I must govern the specifier of IP (C in order to assign case, and 
I for specifier-head agreement), which violates the minimality condition. 

The preceding proof relies on long movement of the NP complement of a verb 
(in a noun complement construction), which is precisely what Barriers strives . 
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to prevent by reducing proper government to antecedent government, using 
the Lasnik-Saito 7 -marking mechanism. (The commitment to eliminating 
^-government from proper government is tentative at best. The strongest 
position taken is “Possibly, a verb does not properly govern its 0-marked 
complement,” p.79). In the Barriers reduction, an argument undergoes long 
movement by adjoining the argument NP to VP, 7 -marking its trace, and 
then deleting the intermediate A-trace at LF. 

This is the exact derivational sequence (adjoin, 7 -mark, delete adjoined 
trace) used in Barriers (pp.21-22) to move a wh-subject from a 0-marked 
CP complement to a specifier of CP, provided the wh- phrase is licensed at 
LF. Barriers attempts to exclude similar long movement of an NP from a 
similar (but caseless) subject position by appeal to Binding Theory condition 
C at S-structure: the NP trace in subject position would be an A-bound 
R-expression A-bound in the domain of the head its chain (p.93, fn.20). 
(Barriers differentiates the two constructions solely by the nature of their 
traces: wh- traces are not R-expressions, while NP-traces are.) Crucially, 
Chomsky’s argument holds only if the trace deleting ability of affect-alpha 
is restricted to LF, contrary to the Lasnik-Saito model: otherwise, adjoined 
traces could be deleted before causing an S-structure binding violation. 

But trace deletion cannot be restricted solely to LF. If it were, then any 
ECP violation created by LF-movement may be avoided, simply by deleting 
offending intermediate traces after they have done their 7 -marking duty. 
This can be done because adjoined A-traces axe not required by the extended 
projection principle at LF. 

Even if this conundrum could be resolved, another one awaits us: the long 
movement used in the proof is applied cyclically, so that the trace of the argu¬ 
ment NP is no longer c-commanded by the argument NP once all movement 
has applied, and hence not A-bound by the head of its chain at S-structure. 
This brazen violation of the c-command condition on chain links is standard 
in the literature, and hence does not raise any special problems here. Ex¬ 
amples include the topicalization example (7) above, antecedent-contained 
ellipsis, 

(15) [Everyone that Max wants to e 2 ]i John will [kiss ei ]2 
and passive VP topicalization in English: 

(16) [yp Arrested t{ by the FBI]_, John, has never been fj] 
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Furthermore, even if trace deletion were disallowed entirely, long movement 
would still be possible from ^-marked noun complements, and the proof 
would proceed, because ^-government cannot be eliminated without disas¬ 
trous consequences in the rest of the theory. Proper government can be 
reduced to antecedent government only if antecedent government suffices 
for NP-movement (eg. passive and raising) in accordance with the chain 
extension operation. This fails because only the terminus of an (extended) 
A-chain may theta-mark or case-mark, in order to obtain the CED effect 
(Condition on Extraction Domains, see Barriers, p.72). Therefore, in pas¬ 
sive constructions, where the A-chain headed by the subject NP must be 
extended to include the verb and inflection and thereby achieve antecedent 
government of the NP-trace at S-structure, the inflection will simultaneously 
loose its ability to case-mark the subject position. The direct consequence 
is that Barriers incorrectly predicts both passives in (17) violate the case 
filter and are ungrammatical, whereas only (17a) is ill-formed. 

(17) 

a. * [ej was killed John 

b. John was killed t 

In short, the chain extension required to satisfy the ECP without theta- 
govemment will prevent the subject NP from receiving case, and thereby 
violate the case filter. This very serious problem may be remedied by aban¬ 
doning either (1) the case filter, which would without question be disastrous 
for the theory, (2) the Barriers analysis of CED effects, which would reduce 
empirical coverage, or (3) the coindexing/chain extension analysis of NP- 
movement, which will have the direct consequence that proper government 
cannot be reduced to antecedent government. 

Additional evidence against the coindexing/chain extension analysis of NP- 
movement comes from gapping and VP-deletion, which raises a contradiction 
for the Barriers chain extension account of simple passives: (1) each passive 
conjunct in (18) contains a distinct R-expression subject that therefore must 
head distinct A-chains with differing indices, but (2) both extended A-chains 
share the same verb, and therefore must share the same index. 



John; was given t t records and Suej [e] books 
John, was [killed t t } k but Tomj wasn’t (e] fc 
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The possibility of long argument movement by adjunction to intermediate 
positions remains in Chomsky’s most recent theory of derivation, where he 
proposes that derivations be subject to a ‘least effort principle,’ with the fol¬ 
lowing provisions. LF permits only the following five elements: arguments, 
adjuncts, lexical elements, predicates, and operator-variable constructions. 
Affect-alpha must apply at LF to each illegitimate object to yield one of 
these five legitimate elements. Chomsky (1988b:20) urges us to “consider 
successive-cyclic A-bar movement from an argument position. This will yield 
a chain that is not a legitimate object, and that can become a legitimate 
object, namely an operator-variable construction, only by eliminating inter¬ 
mediate A-bar traces. We conclude, then, that these must be deleted by LF.” 
A parallel consequence of this theory of derivations is that successive-cyclic 
A-bar movement from a theta-marked argument position to a case-marked 
argument position will also yield an illegitimate object, that can become 
a legitimate object, namely an A-chain, only by eliminating intermediate 
A-bar traces by LF (that is, before LF). 
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7 Controlling complexity in systems of knowl¬ 
edge 

The construction used in the preceding reductions may appear to arise more 
from inconsistancies in the transformational model than from human lan¬ 
guage. But, as I argued above, no simple change to Barriers can block 
the reduction while maintaining the theory’s intended empirical coverage, 
because every property of the reduction construction is independently jus¬ 
tified in the theory. More accurately then, the reduction construction is 
simply a complex, unnatural combination of the properties of actual natural 
constructions. 

In order to explain complex phenomenon, generative linguists invariably 
construct complex theories that are difficult to change and computationally 
intractable. In my opinion, this problem is inherent in the attempt to con¬ 
struct a system of knowledge. If you know about long movement, agreement, 
and head movement, then you also know all possible combinations of those 
linguistic processes. Nor is a cost assigned to employing those combinations 
or determining the consequences of knowledge. Theories of knowledge tire 
inherently non-modular and maximally interactive—they lack techniques 
for controlling complexity—and this is why it is so difficult to construct a 
complex system of knowledge. This is the same critical flaw that doomed 
cybernetics. 

7.1 Taming the complexity of simple operations 

Technically, the preceding proofs have shown that simple local linguistic op¬ 
erations in transformational grammars (including local agreement and move- 
a bounded by the ECP) can have complex consequences. How might the 
complexity of simple linguistic operations be reduced? Interactions among 
operations must be tamed. As things currently stand, this will be difficult to 
do for the transformational models considered above because many concep¬ 
tually distinct relations (forms of syntactic agreement) are uniformly rep¬ 
resented via coindexing, and this maximizes interactions among principles. 
Therefore, the first step is to segregate distinct relations in representation. 
This approach has led phonologists from the solely segmental representa¬ 
tions of SPE to nonlinear models (autosegmental, prosodic). In syntax, 
the potentially distinct anaphora-antecedent, chain link, and specifier-head 



agreement relations would be best represented by distinct relations, rather 
than by the uniform co-indexing mechanism. Representational theories of 
movement (eg., g-projection, dynasty formation, chain formation algorithm) 
might strive to decouple principles on the verge of orthogonality, such as 
binding and government in Rizzi’s relativized minimality approach. 

The second step is to more clearly distinguish linguistic representations from 
the abstract process whereby they are specified. This necessity of this con¬ 
ceptual distinction has been clear since Descartes first argued for it in in¬ 
troductory philosophy. In generalized phrase structure grammar, the local 
tree representation is sharply distinguished from the process of specifying 
the local trees (metarule finite closure, and ID rule projection according 
to universal feature instantiation, et cetera). The distinction is s imi larly 
clear in representational approaches to syntactic movement: chain link rep¬ 
resentations are specified by an abstract process of iteration (g-projection, 
dynasty formation, or chain formation algorithm) whose intermediate states 
are distinct from its output. But in transformational approaches, the fun¬ 
damental distinction between representation and process is blurred, because 
the chain link representation is simply the history of the process of applying 
the move-a transformation. In such a model, it is unclear if the difficulty of 
constructing the linguistic representation is inherent, or merely an artifact 
of an unnecessarily complex specification process. 

Neither representational segregation nor a sharper distinction between pro¬ 
cess and representation entails a reduction in the complexity of simple prin¬ 
ciples. But both steps would certainly clarify the fundamental computa¬ 
tional structure of linguistic theory; hopefully, this clarification will lead to 
an improved understanding of the inherent complexity of simple linguistic 
principles. 
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8 Syntactic Binding 

Every child knows that pronouns stand for nouns, and that speech would 
soon become extremely difficult without them. We would soon grow tired of 
repeating the nouns pronouns stand for over and over. Nor could we easily 
introduce ourselves to fellow travelers and proceed to discuss our respective 
destinations. 

Consider the problem of determining pronoun antecedents. Two pronouns 
can share an antecedent only if they agree and they are not too close. Thus, 
the pronoun her cannot stand for Bill in the sentence Bill saw her mother 
because her is not masculine gender. And the sentence Bill knew he liked 
him cannot mean ‘Bill knew Bill liked Bill,’ unlike the sentence Bill knew he 
liked himself , which could mean that. In effect, pronouns are competing for 
antecedents: in some cases, two pronouns may share an antecedent, and in 
the other ‘disjoint reference’ cases they may not. Exactly how hard is this 
task? 

The interactions among pronouns cam become impossibly complex when 
there are many pronouns and only a few possible antecedents. In fact, 
determining pronoun antecedents is like coloring the nodes of a graph, where 
possible antecedents are colors and pronouns are the nodes of a graph whose 
edges represent disjoint reference conditions. This correspondence proves 
that the pronoun antecedent problem, like the NP-complete graph coloring 
problem, cannot be solved in practice by any known method. How then can 
we understand sentences containing pronouns? 

8.1 Binding at S-structure 

Binding theory is concerned with the permissible linguistic antecedents of 
anaphora, such as pronouns, reflexives, and reciprocals. Binding theory 
states conditions on binding relations between A-positions (positions as¬ 
signed a grammatical function, such as SUBJECT or OBJECT) at S-structure 
and LF. Chomsky (1981.T88) formulates the binding theory as follows: 

( 19 ) A , . , 1 • * 

a. An anaphor is bound in its governing category. 

b. A pronominal is free in its governing category. 

c. An R-expression is free everywhere. 



(Arguments are bound by c-commanding antecedents, else free; a governing 
category for a is the minimal NP or S containing a and a governor of a.) Two 
additional binding conditions are (1) anaphora must find their antecedents in 
some level of linguistic representation, such as S-structure, LF, or discourse 
structure, and (2) anaphora that share an antecedent must agree with each 
other and with that antecedent on agreement features. 4 Occasionally, a 
third condition, the i-within-i condition , is cited to prohibit categories from 
being antecedents of anaphora they dominate. 

A syntactic binding is a set of anaphor-antecedent relations that satisfies the 
binding theory, such that all anaphora have S-structure antecedents. The 
binding problem is: 

Binding Problem (BINDING) 

Given a syntactic representation 5 lacking only binding 

relations, construct a permissible syntactic binding for 5, 

according to binding theory. 

The corresponding decision problem is to decide if a given syntactic repre¬ 
sentation has a permissible syntactic binding. The all bindings problem is 
to enumerate all permissible syntactic bindings for a given syntactic repre¬ 
sentation. 

Theorem 6 The binding decision problem is NP-hard. 

Proof. The proof will be by a trivial reduction from GRAPH fc-COL OR ABILITY 
to BINDING. On input graph G = (V, E) with vertices vi...v n , we need 
n binary agreement features, n pronouns, and k R-expressions. Pronoun p l 
represents vertex vp. for each edge (u;, vj ) from pronoun p, has <fi = 0 
and pronoun pj has <pi = 1. In the S-structure, the ith R-expression Ri 
c-commands the i -(- 1th R-expression R,+i, and each pronoun is free in its 
governing category, where it is c-commanded by all R-expressions, which are 
its potential antecedents. 

4 The transitivity of agreement is trivially demonstrated by the ungrammaticality of 
Every student, prepared heri plan and did htsi homework , where each pronoun agrees with 
its operator antecedent, but the two pronouns disagree with each other, and though neither 
pronoun is the other’s antecedent, the fact that they disagree through their common 
antecedent nonetheless excludes the example. 
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Each pronoun must ultimately have an R-expression antecedent (to satisfy 
the binding theory), which will c-command it (by construction). All R- 
expressions have disjoint reference by binding condition C, and therefore 
there are exactly k distinct possible antecedents. If there is an edge between 
two vertices in the graph, then the two corresponding pronouns cannot share 
an antecedent without disagreeing on some agreement feature; there are only 
k possible antecedents (the R-expressions), and therefore each permissible 
binding for the S-structure corresponds to a fc-coloring of the graph G. 1 I 

Comments. Crucially, the reduction proves that it is the combination of 
disagreement, binding conditions B and C, and prohibitions against circular 
dependencies that creates inherently complex disjoint reference graphs. (The 
disjoint reference graph in the preceding reduction was created entirely via 
feature disagreement simply because that was the simplest reduction.) 

Although the reduction’s extensive use of agreement features does not in¬ 
crease our understanding of natural language, it does offer greater insight 
into linguistic theory. It should be clear that agreement features, a much 
neglected part of linguistic theory, are surprisingly powerful. A reduction 
based on the other disjoint reference conditions (binding conditions B and 
C, and i-within-i), with perhaps a constant number of agreement features, 
might offer better insights into binding phenomenon. 

The other weakness is that the reduction requires an unnaturally arbitrary 
specification of agreement features on pronouns (see below). Functionally, 
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pronouns andanaphors serve to simplify linguistic expressions, and therefore 
one would expect, contrary to the reduction construction, that pronouns and 
anaphors specified for different agreement features would also have different 
antecedents. But this weakness is ultimately attributable to a failure of 
current linguistic theory to view anaphora from this perspective. 

The great strength of this reduction is that it is based directly on a natu¬ 
ral syntactic construction, whose pronouns must find antecedents in order 
for the construction to be understood. Therefore, the binding problem is 
a necessary subproblem of language comprehension. It also raises a sig¬ 
nificant computational question for the binding theory, namely, is binding 
really so much like graph coloring, a problem which is hyper-exponentially 
ambiguous and cannot be efficiently performed by any known method, even 
approximately (Lawler, 1976)? 

8.2 Limiting disagreement with the transparency condition 

The most unnatural characteristic of the binding construction used in the 
preceding reduction from graph coloring is that two pronouns with entirely 
different features could nonetheless share an antecedent, provided that they 
did not disagree on any common feature. That is, two pronouns could share 
an antecedent if their agreement features could be unified: they need not 
agree exactly, but they must not have different values for the same feature. 
In natural language, it seems that an argument may bind anaphora that axe 
virtually identical, modulo case and affix (for example, the anaphoric suffix 
-self). Let us say that two anaphora are transparently identical if they have 
the same root morpheme and agree exactly on agreement features, although 
they may have different cases or affixes. This relation partitions anaphora 
into equivalence classes, which we may call transparency classes (t-classes). 
For example, he, him , and himself are transparently identical, and form one 
t-class in the English lexicon. 

In light of these observations, I propose an additional binding condition, 
which limits interactions among t-identical anaphora. 


Transparency condition 

All anaphora with the same antecedent must be t-identical. 


The transparency condition explains why, in some dialects of English, a 
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masculine singular antecedent can independently bind the pronouns his and 
their , but cannot simultaneously bind both pronouns in the same structure: 


^ ^a. Every boy; did his, homework. 

b. Every boy, did their, homework. 

c. * Every boy, did his; homework and prepared their; lesson plan. 

( 21 ) 

a. Sue saw the dog; when it, barked. 

b. Sue saw the dog; after he; was kicked. 

c. * Sue saw the dog; when it; barked after he, was kicked. 

(Note how poor the second example is with the pronouns he and it inter¬ 
changed.) The transparency condition can also explain why disjoint refer¬ 
ence conditions only appear to hold within a t-class: 


(22) Bill; reminded Sue_, that [ gc ( ( ft e ,>) he, paid for their{,jj. tickets] 

A compound noun may be the antecedent of two t-classes, in apparent vi¬ 
olation of the transparency condition. (Of course, it is always possible to 
make up a story where the compound noun posses two indices, as in [baby 
brother j/;.) 


(23) Suzie’s [baby brother]; cried when she left it,, but soon thereafter 
he, went to sleep. 

Unfortunately, although this transparency condition successfully prevents 
the particular details of the preceding reduction, it does not in general suf¬ 
fice to limit binding interactions. Now t-classes, rather them individual pro¬ 
nouns, compete for antecedents in the presense of disagreement conditions. 
In the next section I will present a pair of binding constructions that satisfy 
the transparency condition, and then use those constructions to simulate 
QUANTIFIED 3SAT. Moreover, the disjoint reference graph arising from 
binding conditions B and C remains unaffected. 



9 Binding and Copying in Syntax 


Language users inwardly relate a complete meaning to every sound uttered 
as language: that is, for a language user, every sound is a complete utterance. 
This gives language tremendous expressive power, especially when taken in 
combination with language’s intricate underlying structure. 

For example, the sentence the men ate but the women didn’t is understood 
to mean ‘the men ate but the women didn’t eat.’ In so-called VP-deletion 
constructions, the second verb phrase is an invisible copy of the first verb 
phrase: understood, but not spoken. If the verb phrase contains a pronoun, 
that pronoun may be understood as though it were also copied, because 
the pronoun and its copy may have different antecedents. For example, the 
sentence the men ate their dinner but the women didn’t can mean ‘the men 
ate the men’s dinner but the women didn’t eat the women’s dinner.’ 

If the twin subjects of a VP-deletion construction disagreed on some feature 
(gender in the preceding examples) and the pronoun in the deleted verb 
phrase represented a variable in a Boolean formula, then such a so-called 
sloppy identity construction would represent a universal quantifier over that 
feature: the construction would be permissible only if the pronoun could 
could be independently bound by both subjects. This is possible only if the 
pronoun can agree with both (contrary) feature settings. 

A slightly different binding construction mimics an existential quantifier. In 
the sentence the men knew the women ate their dinner , either subject may 
bind the pronoun their. Continuing with the idea of pronouns as variables 
and antecedents as truth values, this binding construction represents a exis¬ 
tential quantifier over some feature, where the pronoun can be freely bound 
by either of the two antecedents that disagree on that feature. 

A third binding construction can represent 3SAT instances. The sentence 
he introduced him to him is necessarily about three people: no two of the 
three pronouns can share an antecedent. Similarly, at least one of the three 
pronouns in the sentence John thought Peter said he introduced him to him 
must refer to a third, unnamed male person. This construction can be used 
to mimic a 3-CNF clause, where at least one of the pronouns must be bound 
outside the construction, to an antecedent representing a quantified variable. 
This pronoun represents a true literal in the 3-CNF clause. 

Finally, if the VP-deletion and binding constructions are put together prop- 
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erly, they would represent quantified Boolean formulas. Therefore, deter¬ 
mining the meaning of such linguistic utterances can be as difficult as playing 
board games, such as Checkers, Chess, and Go. 


9.1 Linking and Copying at LF 

Every linguistic account of sloppy identity in VP-deletion constructions re¬ 
quires that the overt VP in effect be copied to the empty VP position, and 
that pronominal binding occur both before and after that copying is per¬ 
formed. Some representative accounts include: copying or ‘reconstruction’ 
of the VP directly at S-structure (Koster, 1987; Ross, 1967); copying in 
the S-structure to LF mapping (May, 1985); copying and interpretation in 
syntax (Wasow, 1972); or in two stages—copying an empty VP structure at 
S-structure and interpreting the antecedent VP at LF, in Sentence Gram¬ 
mar, and then copying the interpreted antecedent VP after LF, in Discourse 
Grammar (Williams, 1977). 5 Although these theories differ in the exact for¬ 
mulation of the copying rule, at what level of representation the rule applies, 
and the rule’s name, real differences are extremely minor, and do not affect 
the subsequent discussion or the proof at all. 

Without loss of generality, I will restrict my attention to binding and copying 
at LF in the T-model (for example, May 1985). In the T-model anaphora- 
antecedent relations may be determined both at S-structure and LF, while 

5 Williams (1977) proposes that Sentence Grammar contains an optional Pronoun Rule 
that replaces pronouns inside a VP with variables bound by lambda operators whose scope 
is the entire VP. For example, John saw his mother is assigned the LF [John [Xx (x saw 
[x's mother])]] when the Pronoun Rule applies, and /John [Xx (x saw [his mother])]] when 
it does not. (Note that Williams claims both LF representations have the same meaning, 
but his failure to explicitly relate the pronoun his to its antecedent in the second LF 
representation is incoherent.) Discourse Grammar strictly follows Sentence Grammar, and 
contains a VP Rule that copies entire VP-lambda expressions. Sloppy identity readings 
occur when VP copying applies to the output of the Pronoun Rule; alternate readings 
occur when the optional Pronoun Rule does not apply. Therefore, Williams’ account 
would seem to preclude free pronoun binding after VP copying. But, by Williams’ own 
arguments, the Pronoun Rule must belong to Discourse Grammar because it operates 
across sentence boundaries: 

a. Frank loves Sue. 

b. No, he only thinks he loves her. 

Therefore, Williams’ framework must also allow pronoun binding to occur both before 
and after VP copying, contrary to his claims. 
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constituent copying, as required for VP-deletion constructions, is performed 
in the S-structure to LF mapping. For example, (25) can mean either that 
the women ate the women’s dinner, or that the women ate the men’s dinner 
(see 26). 


(25) The men didn’t eat their dinner but the women did 


(26) 


a. The men, didn’t eat their, dinner but the women; ate their, 
dinner 

b. The men, didn’t eat their, dinner but the women; ate their; 
dinner 


In example (26a), binding is performed before copying, while in (26b), bind¬ 
ing is performed after copying. Conversely, example (27) does not have a 
reading where some women eat some man’s dinner. 

(27) Which men didn’t eat their dinner and which women did 

This is explained by the binding theory, which states that pronouns may 
only link to A-positions, and the LF binding conditions, which require each 
operator to c-command its variable and all anaphora linked to that variable. 
Therefore, in this VP-deletion construction, anaphora can only be bound by 
the subject of the VP that contains them at LF. 

I now make crucial use of sloppy identity in VP-deletion constructions to 
prove that the S-structure to LF mapping is extremely difficult. Informally, 
if we embed some VP-deletion constructions, and hide a parameterized com¬ 
putation in the innermost VP, we might then have to perform that compu¬ 
tation differently an exponential number of times. 

The reduction is clearest when presented assuming the linking theory of Hig¬ 
ginbotham (1983), which replaces the indices of the standard binding theory 
with directed edges (links) from anaphor to antecedent, and the stipulatory 
i-within-i condition with a more natural (semantic) prohibition against cir¬ 
cular dependencies. (An element depends on all elements it dominates, and 
on all elements its antecedents depend on.) 


Theorem 7 The S-structure to LF mapping is PSPACE-hard. 
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Proof. By reduction from QUANTIFIED 3SAT. The input fl is a quanti¬ 
fied Boolean formula in prenex 3-CNF, consisting of alternating quantifiers 
Vrx3r 2 ... Vr n _x3r n preceding (and quantifying the literals in) the clauses 
Cu C 2 , . . ., C p in the Boolean variables x\, X 2 , ..., x n . Each clause contains 
exactly three distinct literals labeled by C, = (a; V 6 ; Vc t ). The output is an 
S-structure 5 such that 5 has a permissible LF representation iff fi is true. 

Each quantifier (Vr; or 3r;) is represented by a S-structure construction 
QC, where antecedents in A-positions represent the quantified variables. 
Each Boolean clause Cj is represented by an S-structure construction CCj, 
where pronouns represent formula literals, and at least one pronoun in each 
CCj must link to some c-commanding A-position in one of the dominating 
QC; constructions. Thus, the entire QUANTIFIED 3SAT instance is rep¬ 
resented by an S-structure composed of QC and CC constructions: each 
QC; immediately contains QC t+1 ; QC n immediately contains CCi; and 
each CCj immediately contains CCj+i, as shown in figure 5. 

A universal quantifier is represented by a VP-deletion construction. In 
the VP-deletion construction QC;, the disagreeing operator-variable chains 
(0p t , [e];) and (Op;, je] ; ) represent the quantifier determined truth assign¬ 
ments Xi = 1 and Xi = 0, respectively. S-structure linking conditions require 
pronouns to only link to A-positions (in this case, to the syntactic variables 
in subject position). Feature agreement on the ID feature requires pronouns 
representing literals of r; to link to either [e]; or [e] •, and never to another 
A-position in another QC construction. Recall that linking conditions at LF 
require each operator to c-command its syntactic variable and all anaphora 
linked to that syntactic variable. Therefore, because each variable in subject 
position is locally bound by an operator, pronouns contained in the original 
VP must link to the syntactic variable [e]; linked to Op;; these pronouns 
will correspond to unnegated literals, which are true iff i; is true. Similarly, 
pronouns in the copied VP must link to the syntactic variable [e] • linked to 
Op;; these pronouns will correspond to negated literals, which are true iff 
Xi is false. 

An existential quantifier is represented by a linking ambiguity. In the lin kin g 
ambiguity construction QC t+1 , the R-expressions NP,+i and NP ; +1 repre¬ 
sent the two possible truth assignments x ; +1 = 1 and *; +1 = 0, respectively. 
Structural conditions (c-command) and feature agreement on the ID feature 
force the pronoun p * +1 to link to either NP ; +1 or NP; + j. The pronoun p* +1 
bears an additional feature, indicated by that disagrees with a feature 
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Figure 5: The S-structure in the figure represents the QUANTIFIED 3SAT in¬ 
stance Vz!3a:2 ... V* n _ 1 3* n C' l , C 2 , • • ■ C p . Each universal quantifier Vz,- is repre¬ 
sented by a VP-deletion construction. At LF, each of the n/2 circled overt VPs is 
copied to its empty VP position [vp e]. The resulting LF is exponentially larger 
than the S-structure shown. Each existential quantifier 3z,-+j is represented by a 
linking ambiguity construction. Both quantifier constructions contain permissible 
antecedents that represent possible truth assignments to their quantified variable. 
Each 3-CNF clause Cj is represented by a CCj construction that contains a selected 
pronoun that must link outside that construction, to a permissible antecedent in 
some dominating QC* construction. These obligatory long distance links are drawn 
with dashed arrows. Each selected pronoun represents a true literal in its 3-CNF 
clause. 
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Figure 6: The VP-deletion construction QCj represents the universal quantifier 
V*j. The disagreeing operator-variable chains (Op;, [e];) and (Op^, [e]J represent 
the truth assignments Xi ~ l and Xi = 0, respectively. LF linking conditions ensure 
that pronouns contained in the CC constructions may only link to the subject of 
the VP that contains them (identified by bold arrows). Therefore, only pronouns 
in the original VP representing unnegated literals of x t may link to [ej;. Simi¬ 
larly, only pronouns in the copied VP representing negated literals of *,• may link 
[e]^. The circled VP leaf node will immediately dominate the QCj + i construction 
representing 3*,+i. 
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Figure 7: The linking ambiguity construction QC i+ i represents the existential 
quantifier 3z i+1 . The disagreeing R-expressions NP i+1 and NP i+1 (identified by 
bold arrows) represent the truth assignments Zi +i = 1 and * <+1 = 0, respectively. 
The pronoun p* +1 must link to one of these two R-expressions, which it then “dis¬ 
ables” as follows. No pronoun contained in a CC construction may link to p* +1 or 
its antecedent, because of disagreement on the feature. Therefore, all pronouns 
representing literals of * i+1 must link to the same R-expression, either NP i+1 or 
NPj+i, which corresponds to assigning a consistent truth value to *,- +1 everywhere. 
The CP leaf node will immediately dominate the QC ;+2 construction representing 
Vajj+2- 
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Figure 8: The S-structure construction CCj represents the clause Cj of the 3-CNF 
formula. Only two antecedents (NPj X , NPj 2 ) are available to the three disjoint pro¬ 
nouns aj,bj,Cj in the construction. Therefore at least one of the three pronouns 
must be linked to one of the c-commanding antecedents in some dominating Q de¬ 
construction. The NPj + i leaf will immediately dominate the next CC J+l construc¬ 
tion. 


borne by all pronouns in the CC constructions. Thus, feature disagree¬ 
ment prevents both p* +1 and its antecedent (either NP t+1 or NP i+1 ) from 
binding any pronoun (representing a literal of ,c t+1 ) contained in any CCj. 
Therefore, those pronouns in CCj must be bound by the NP subject that 
Pi+i does not link to. For example, if p* +1 links to NP l+1 , then NP i+1 
may bind pronouns in any CCj that correspond to Xi + x , which is true iff 
Xi+i = 0. Otherwise, p* +1 must link to NP, +1 , and then NP; + i will bind 
those pronouns that correspond to Xi + x , which is true iff Zj +I = 1. 

3-CNF clauses are represented by linking ambiguities. In the S-structure 
construction CCj, each clause Cj of the Boolean formula is represented by 
a norm phrase NPj, and each literal of the Boolean clause is represented by 



a pronoun with a truth value that makes that literal true. (Consequently, 
pronouns representing negated literals will only agree with Op- and NPi+i 
in some dominating QC, construction.) 

Feature agreement forces pronouns p ]X and p j2 to both link to NP;+i. There¬ 
fore, no pronoun dominated by NP; +i may have NP; X , NP; 2 , Pji, or p; 2 as 
an antecedent without creating an impermissible circular dependence. The 
three pronouns u;,b;,c; c-command NP J + 1 and all it contains; therefore, 
they cannot be linked to any A-position below them. Nor may they be linked 
to each other or otherwise share an antecedent without violating linking con¬ 
dition B, because they are in the same governing category. (Nothing hinges 
on being able to stuff three pronouns in one governing category, because 
disjoint reference can always be accomplished with feature disagreement.) 
Moreover, because only two distinct antecedents (NP;t, NP; 2 ) are available 
to the three disjoint pronouns in the entire CC; construction, at least one of 
the three pronouns must be linked to — and agree with — a c-commanding 
subject outside CC;, in some dominating QC, construction. This pronoun 
represents a true literal in the Boolean clause Cj : it agrees with the subject 
it is linked to in some QC,, and is called the selected pronoun. Note that 
the CC constructions are contained inside nj 2 VP-deletion constructions 
at S-structure, and that therefore the corresponding LF representation will 
contain 2^ 2 copies of the CC, S-structure, each capable of selecting its own 
pronoun. 

The selected pronoun may represent a literal of either a universally or ex¬ 
istentially quantified variable aq. If aq is universally quantified, then the 
agreement condition ensures that only pronouns p, corresponding to un¬ 
negated literals aq and contained in the original VP at LF can link to the 
subject [e],- of the original VP. Conversely, only pronouns pi corresponding 
to negated literals aq and contained in the copied VP at LF can link to the 
subject [e] ■ of the copied VP. 

Otherwise, aq is existentially quantified, and the agreement condition ensures 
that pronouns pi corresponding to unnegated literals aq can only link to NP,, 
that pronouns p, corresponding to negated literals aq can only link to NP,-. 
Recall that if any pronoun in any CC; links to NP; (or NP;), then all 
pronouns in all CC; must link to that antecedent. 

The CC; construction is permissible iff all of its LF copies are permissible, 
which is only possible when the Boolean clause Cj contains a true literal for 
any possible quantifier-determined truth assignment to its literals, as rep- 



resented by the dominating QC constructions. Therefore, the LF represen¬ 
tation of the entire S-structure construction is permissible iff the quantified 
formula Cl is true.l | 

Note that the binding constructions of this proof can also be used to give 
a reduction from 3SAT to the binding problem, where each pronoun is no 
more than four-ways ambiguous. 

In our examination of transformational theories of syntax, it was not clear 
how the complexity of the lexical resolution problem reflected on the com¬ 
plexity of the more general language comprehension problem because it was 
conceivable, although unlikely, that the phonological form corresponding to 
the complex S-structure of the reduction could also correspond to a simple 
S-structure. If that were the case, the hearer could always choose the simpler 
S-structure, the difficult instances of the LRP would not arise in practice, 
and the complexity of the LRP would not directly bear on the complexity 
of language perception. 

But in the preceding proof, and in the proof that the S-structure binding 
decision problem is NP-hard, the phrase structure constructions are trivial: 
there is no movement, all positions are strongly identified, and all features 
are explicit on the lexical items. The lack of ambiguity in and the directness 
of the PF to S-structure mapping for these constructions provides extremely 
powerful evidence that language perception is intractable. 

A central consequence of the PSPACE-hard result is that linguistic rep¬ 
resentations in the T-model do not have efficient witnesses, unless NP = 
PSPACE. In another words, it is not possible to efficiently determine the 
grammaticality of a complete linguistic representation (in our case, a com¬ 
pletely specified D-structure, S-structure, LF, and PF). Therefore, using 
linguistic knowledge in language comprehension can be intractable. (The 
complexity of language comprehension is measured relative to the size of 
the phonological form or surface structure.) 

Theorem 8 Using linguistic knowledge is PSPACE-hard. 

Proof. A corollary to theorem 7, given the triviality and directness of 
the PF to S-structure mapping (hearing) and its inverse (speaking) for the 
VP-deletion/linking ambiguity construction of theorem 7Q 
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